Data Analysis

Basic Statistics

Training Data

Width Height Elongation Roundness Complexity Symmetry Average Nearest Neighbor Distance Maximum Nearest Neighbor Distance Fractal Dimension
count 3827.000000 3827.000000 3827.000000 3827.000000 3827.000000 3827.000000 3827.000000 3827.000000 3827.000000
mean 0.495535 0.444266 0.029252 0.762248 0.159897 0.068838 0.055194 0.402168 0.541963
std 0.143130 0.162976 0.048309 0.168430 0.079812 0.070219 0.050244 0.104044 0.172398
min 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000
25% 0.401826 0.324009 0.005131 0.695407 0.103656 0.036332 0.011574 0.356211 0.383856
50% 0.543379 0.475524 0.014070 0.812284 0.142008 0.051809 0.049588 0.381495 0.508433
75% 0.547945 0.543124 0.032787 0.871475 0.197451 0.076973 0.089671 0.448400 0.702124
max 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000

Testing Data

Width Height Elongation Roundness Complexity Symmetry Average Nearest Neighbor Distance Maximum Nearest Neighbor Distance Fractal Dimension
count 3826.000000 3826.000000 3826.000000 3826.000000 3826.000000 3826.000000 3826.000000 3826.000000 3826.000000
mean 0.492044 0.436596 0.029093 0.760377 0.157623 0.070555 0.054787 0.398249 0.540884
std 0.147428 0.160751 0.043377 0.167911 0.078726 0.074900 0.047324 0.104094 0.170935
min 0.011416 -0.016317 0.000000 0.049244 -0.004674 -0.026459 0.000314 -0.003401 0.080464
25% 0.401826 0.317016 0.005237 0.692080 0.103793 0.036808 0.010692 0.355833 0.384223
50% 0.538813 0.468531 0.014260 0.811166 0.140807 0.052061 0.048464 0.379703 0.506129
75% 0.547945 0.543124 0.032937 0.868036 0.194366 0.077464 0.089701 0.443317 0.695504
max 2.175799 0.951049 0.509477 1.000653 1.114948 0.964708 0.276530 1.705619 1.016889

Scatter Plot

Using Matplotlib

Scatter Plot

Feature Histograms

Histograms

Correlation Matrix Heatmap

Correlation Heatmap

Using Plotly

Additional Visualizations

Box Plots

Box Plots

Pair Plot

Pair Plot

Count Plot of Labels

Count Plot

3D Scatter Plot

Correlation Matrix Heatmap (Selected Features)

Correlation Heatmap (Selected Features)

More Visualizations

Feature Distributions by Label

Feature Distributions

Feature Distributions by Label

Feature Distributions

Pie Chart of Label Distribution

Pie Chart

3D Scatter Plot (Roundness vs Complexity)

PCA Visualization

PCA Visualization